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(54) Abstract Title: Voltage margin testing system for computer systems 

(57) The present invention provides a voltage margin 
testing system incorporated in an electronic system 
[1081, such as, a computer system (e.g., a server), 
having a plurality of components [1 141 for at least 
some of which voltage margin testing is required. A 
voltage margin testing of the invention can include 
a controller [56], such as a Baseboard Management 
Controller (BMC), internal to the computer system 
and a digital voltage adjuster [110], e.g., a digital 
potentiometer, that is in communication with the 
controller [56]. The voltage adjuster [110] can effect 
generation of one or more test voltages, for 
example, by varying resistance in a feedback 
circuitry of a regulator whose output voltage is 
applied to system components, for application to 
the components in response to commands from the 
controller [561. 



FIB. 10 



DfTRUKKf BU 


SAiaUTEHGHT 




GWIffllB) 





FPU 



Btt \m~JL, 



IIS 

/ 



"5« 

OI&ITAL ALIAS 
ADJUSTS 



32 



VOLTAGE REGULATOR 
VOUT m 



112 



ill 



CD 

CD 



CO 
CO 

ro 

O) 



Original Printed on Recycled Paper 



1/17 



FIG. SA 



10 



__1 

EXTERNAL 
SYSTEM 



I 



IB 



I 



CONTROLLER [ L 



ZIKJ 



HAflDHARE 
HOWTOfl 



3M 



SENSORS 



HARGDMBLE SYSTEM 
COHWNIS 



OS 

HANAGBCNT AGENTS 
HOST PROCESSOR 



12 



DIGITAL 

adx 


PARAMETER 
ESTER 






MODULE FOR 
GENERATING 
SELECTED 
OPERATING 
PARAMETER 



-22 



-30 



2/17 



FIG. IB 




3/17 



FIG. 2 



10 




40 



STAND-BY 
POWER 


14 

/ 




MMfTTW 1 CO 




WLU*n 





POWER 






CONTROL 





I 



OS/ AGENTS 



42 





VGB 


r 1 









FCB 


,/db 












FBB 







4/17 



FIG. 3 



40 





! / 


EXTERNAL 
SYSTEM 




STAND-BY 
POKR 



14 



CONTROLLER 



3Kr 



FCB 



FBB 



OS/ AGENTS 



POKER 
CONTROL 



HAJCWRE 
MONITOR 



3fsj ■ r 

-j YCB ~ 



> 



HAflGINABLE 
VOLT AGE RAILS 



5/17 



FI6. 4A 

42 

, L 



f 


46 

/ 


POUER CONTROL 
ELEMENT 


CONIROLLER 




PflOGRAHHABLE LOGIC DEVICE 




1 












HARDWARE 
HOfCTOfl 



6/17 



1 



FIG. 4B 



CONTROLLER 



12C/SEHIAL 
LINK 



mi BYPASS 

block m 



W/WEST.0UT- 
TKERHi- 



SMLATED TACH 




SHBCUC- 
FANl/AINi- 
- FAN2MBB- 
CI- 
6ND- 
Y»- 

, INr#/ALEHT#H 
ACOut/NTESUN- 
RESET* • 



52 



20 



2 

23 



22 
21 
20 

6 LHB719 
lfl 



7 
B 
9 
10 
11 



12 13- 



17 
16 
15 
14 
13 



-WDO/DWO 
-VID1/IBQ1 
-VTO2/IR02 

— nra/iBoa 

-WM/IB04 
-Vccpl 
-♦2.5Wn/02* 
-Vccg2/D2- 

♦12Vin 
Olf 
01- 




FB8 MASKED FAULT 




7/17 



FIG. 5 

54- 



INTELLIGENT FRJ 



^62 



SATELLITE HGHT 
CONTROLLER 



SE 



BMC 



•IB 



HOST PROCESSOR 
K6NT AGENTS 

r 

12 



I 



FRU 



SEEPROH 



-84 



1 



1 



HARDWARE 
MONITOR 



60 

jL 



MOTHERBOARD 
SEEPROH 



06 





12C INTERFACE 


CRYSTAL 
OSCILLATOR 


\ 


66a 


\ 

Eft 



SENSORS 



OUTPUT 



HAR6INABLE SYSTEM 
COMPONENTS 

* 

6B 



8/17 




9/17 



FI6. 6B 



CONTROLLER 







/14 




9K 



10/17 



FIG. 7 



INTELLIGENT FRU 


SATELLITE HGHT 




CONTROLLER 





SEEPROH 




11/17 



FIG. 8 






hardwire 




MOTHERBOARD 




MONITOR 




SEEPROM 



SENSORS 



SYSTEM 
COMPONENTS 



CLOCK SIGNAL 



ol i2 



r, 



on 



FREQUENCY 
SYNTHESIZER 



REF 



CUC OUT 

HZ 



Bn 

PATTERN 

FOR FREO 
SELECT 



-SB 



CLOCK 
SOURCE 

TAU 



12/17 



FIG. 9A 



DEFAULT SIGNAL C 
FROM OPU 



100 






MUX 


■ 0 


SEL 


1 




l 


MUX 


0 





104 

' L 

mm 



106 

t 

CLOCK 
GENERATOR 



13/17 



FIG. 9B 



innmrnm testing, hth hain power ot. 

BY SENDING A COMMAND TO BHC TO CAUSE 
BHC TO LOOP THOUGH A SET OF CtWWCS 
TRANSMITTED TO THE FREQUENCY SYNTHESIZER. 
^ OSS F^rc SYNTHESIZER 
TO GENERATE A TEST FREQUENCY FOR 
APPLICATION TO FSB 



^A 



HAIN POWER SUPPLY OF SERVER 



ESaS J^ i&t 1 ™ ro DETERKENE RESPONSE 
OF SERVER TO THE TEST FREQUENCY 







1 SNITCH OFF RAIN PONER SUPPLY j 






REPEAT STEPS B THROUGH C FOR OTHER 
TEST FREQUENCIES 



1-4/17 



FIG. 9C 



DC S£I M. flifflMOTs output j^* 



EXECUTE A DIAGNOSTICS SOFTWARE ID 

aspects OF SEflO'S orewnoN T TIC 

ItSI VALlt 



mm t mmm to oetemne wcna 



BBT !E,{EJ 0 SET SINTHESIZEH'S OUTPUT L^D 
EBEOIBICT ID ANMCR TEST VALUE ^ 



REPEAT STEPS B THROUGH C FOR ALL 
TEST VALUES 



r 



15/17 



FIG. 10 
108. 



INTELLIGENT FRU 


SATELLITE HGHT 




CQKIROLLEfl 





56 

11 



BMC 



1 



OS 



I 



HARDKAHE 
MONITOR 



SENSORS 



FRU 



SEEPROH 



MOTHERBOARD 
SEEPROH 



110 



1ZC INTERFACE 
110a 



DIGITAL VOLTAGE 
ADJUSTER 



VOLTAGE REGULATOR 
VOUT VIN 



HARSTXABLE 
COMPONENT 



112 



16/17 



I 



a 





X 



cn 

s 



snaaoflflfloflflni 



3 



Si 



r— IH 1 



i 



ru cn ««■ in a n B w g ^ £J 

JDunnnnj 



ira 



ft 



nu 



i 



8 



i 



Be 



= .5 



* * 3 



2403826 

USE OF f CEASED POTENTIOMETERS TO ENABLE VOLTAGE RAIL 
VARIATION UNDER BMC CONTROL 



Rejaj 



Thepresent application isrelatcd to the following conmwn* owned U.S. Patent 
Apphcahons, incorporated in their entirety herein by reference: 

U.S. Patent Application entitled "USB OF I 2 C PROGRAMMABLE CLOCK GENERATOR 
TO ENABLE FREQUENCY VARIATION UNDER BMC CONTROL," naming as 

200208055-1); US. Patent Application entitled "METHODS AND SYSTEMS FOR 
MASKING FAULTS IN A MARGIN TESTING ENVIRONMENT' naming as inventors 
Benjamu, T. Percer and Naysen J.Roberston (Attorney Docket No.: 200312936.1); and US 
Patent Application entitled "METHOD AND CONSTRUCT FOR ENABLING 
PROGRAMMABLE, D^TEGRATED SYSTEM MARGIN TESTING" naming as inventors 



The present invention relates generally to systems and methods for voltage margin 

7 *— such as a computer system. mZ 

particularly, the invention relates to the use of I 2 C based notenH*™, , ' 
variationunderBMCcontroL ^Potentiometers to enable voltage rail 

Electronic systems often include a myriad of subsystems and components that require 

within specified operating conditions. Many of J 
components typ,cally exhibit subtle failures at margins or extremes of such spec** 
^ conditions * is thus desirable to test the components at these m^Te^ 
referred to as margm testing, to evaluate their reliability at the extreme JT 

iTtL ^ "* •** voltage, dHvta, 
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variability. Margin tasting can also ensure that a particular design can be readily adapted to 
evolving changes in manufacturing processes. 

For example, testing a computer system's component by systematically varying a 
voltage applied thereto can provide valuable rafoimation about the reliability of that 
component over its associated voltage tolerance range, and especially at the extremes of this 
range. A number of systems and methods are known in the art for performing such voltage 
margin testing. For example, in one such traditional method, feedback resistors in a voltage 
regulator are replaced in order to vary an output voltage of the regulator that is applied to one 
or more components for which voltage margin testing is desired. This technique is not only 
time consuming but it is also invasive in that one ore more resistors need to be physically 
replaced. Such resistor replacement may, however, lead to accidental damage of the system 
under test (SUT) and/or unreliable test results. 

In another conventional approach, a plurality of jumpers or switches are employed to 
modify resistor values of feedback voltage dividers utilized in voltage regulators. This 
approach is, however, time consuming, and it also suffers from low granularity of available 
voltage variations. It further requires the use. of valuable board space for incorporation of 
jumpers and switches. 

Another conventional approach provides a voltage rail, which supplies voltage to 
system components, with a trim input to which an analog voltage can be applied to vary the 
rail's voltage. This approach, however, requires external software-controlled test equipment 
for supplying the analog voltage with concomitant expense and time needed for procurement 
and setup of the test equipment. 

in yet another traditional testing technique, the internal power rails in a system under 
test are disconnected and an external voltage source is utilized to apply power to the rails. 
The external voltage, and consequently the voltage of the internal rails, can then be varied for 
testing. Similar to the previous traditional approaches, this technique also suffers from a 
number of shortcomings. For example, it requires expensive external test equipment. 
Further, it is invasive, and hence prone to accidental damage to SUT. In addition, the 
obtained test result may not accurately reflect the characteristics of the SUT. 
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Hence, there is a need for systems and methods that allow reliably performing voltage 
margm testing of components of a computer system. There is also a need for snch systems 
and methods that allow accurate voltage margin testing without a need to physically modify 
the system under test y 

Summary ofH. n T-y^fi,,,, 

In one aspect, the present invention provides a voltage margin testing system 
incorporated in an electronic system, such as, a computer system (e.g., a server), having . 
^^•-^^ir.la^^afi^^^^,,-^ A 
vohage margin testing of the invention can include a controller, such as a Baseboard 
Manag«nent Controller (BMC), internal to the computer system and a digital voltage 
adjuster, e.g., a digital potentiometer, that is in commotion with the controller The 
voltage adjuster can effect generation of one ox more test voltages, for example, by varying 
^stance in a feedback circuitry of a regulator whose output voitage is apphed to system 
components, for appHcatian to the comports in response to commands from the stroller. 

Inanotheraspect, ••IW*^,^^.*^^^. 
processor and a plurality of components in communication with said processor for 

the system components. The voltage adjust* can affect generation of one or more test 

In further aspects, the invention provides a method for voltage margin testing of one 
or more components of an electronic system, having an internal controller, and a digital 
voltage adjuster, in communication with the controller and with at least a power rail 
TOvoltagetotnecomponents. The method includes a step ofcausing the control to 
tnmsnut one or more commands to the voltage adjuster to cause the adjuster to affect 
^eranonofoneormore test voltages at saidpower rail for apphcation to said components 
The response of the system is then monitored at each of the test voltages. 



3 



J 



Brief Descri ption of the Drawing 

FIGURE 1 A schematically depicts a margin testing system according to one 
embodiment of the teachings of the invention incorporated into a computer system for testing 
selected components thereof; 

FIGURE IB is a flow chart depicting the steps in one embodiment of a method of the 
invention for margin testing of a selected operating parameter of a computer system, 

FIGURE 2 schematically depicts a computer system in which a margin testing system 
according to one embodiment of the invention, having a frequency control module, a voltage 
control module and a fault bypass module, is incorporated, 

FIGURE 3 schematically depicts that a voltage control module of the testing system 
ofFIGURE 2 can be utilized for voltage margin testing of selected components of the 
computer system, 

FIGURE 4A schematically depicts an exemplary implementation of an FBB module 
according to one embodiment of the invention, 

FIGURE 4B schematically depicts the use of an FBB module in combination with a 
hardware monitor to made selected faults during margin testing of a computer system in 
which a margin testing system according to one embodiment of the invention is incoiporated, 

FIGURE 5 schematically depicts the incorporation of a margin testing system 
according to one embodiment of the invention in a server employing an IPMI protocol, 

FIGURE 6A is a schematic diagram of a frequency synthesizer suitable for use in the 
margin testing system according to the teachings of the invention, 

FIGURE 6B is a schematic diagram of an exemplary implementation of a frequency 
margin testing system according to one embodiment of the invention, 
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FIGURE 7 schematically depicts the use of a frequency synthesizer whose output 
frequency can be adjustedby an input bit pa«em in a margin testing system of the invention, 

FIGURE 8 schematically depicts a margin testing system according to one 
embodiment of the invention in which an I>C-based I/O expander is incorporated, 

FIGURE 9A schematically allustrates an embodiment of a margin testing system of 
Ae invention that utilizes an I 2 C-based I/O expander and multiplexers to ensure that default 
frequencies are applied to selected components in the absence of instructions from a BMC 
controller or in the event of circuit errors), 

FIGURE 9B is a flow chart depicting various steps in one embodiment of a method of 
the invention for frequency margin testing of a computer server, 



HOURS 9C is a flow chart depicting various steps in another embodiment of a 

server, 



■WW of the taraata fo, ffequ^cy nu»gin fcajng rfa 



one fflbodnBcnt of Hie invention for adjusting the regulators' output voltages, and 

Detailed Dgggfetfai 

The pa«™ in vration relaes geae^v to ^ 

(r, . seaver) o, . ne^^ ^ * 

— . - invent c^,^ jr^r 

ofaco^fc. ^fc^..,^. ^ 
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of interest, e g., frequency or voltage, associated with selected components of the computer 
system through a plurality of test values in response to commands from the controller. More 
particularly, the output of the parameter adjuster, and hence the value of the operating 
parameter applied to one or more components under test, can be varied over a selected range, 
via command signals from the controller, and the response of the system can be collected, 
monitored and/or analyzed. 

Although the following embodiments of margin testing systems of the invention are 
described with reference to computer systems, it should be understood that margin testing 
systems according to the teachings of the invention can also be incorporated in other 
electronic systems, such as, network switches, for which margin testing is needed. 

FIGURE 1 A schematically illustrates an exemplary computer system 10 in which a 
margin testing system according to the teachings of the invention is incoiporated. The 
computer system 10 can be, for example, a server computer system which is generally 
understood in the art to be a system configured, by hardware and/or software, to pro 
high degree of performance in communications with other computer systems over a 
communications network, or any other computer system for which margin testing is needed. 
Although the exemplary computer system 10 includes a single host processor 12, it should be 
understood that a margin testing system according to the teachings of the invention can also 
be incorporated in multi-processor systems. 

The exemplary computer system 10 includes a controller 14 mat can provide a 
plurality of management functions, as described below, and is in communication, via a 
system interface 16, with the host processor 12 on which an operating system (OS) and one 
or more management agents run. The system interface 16canbe, for example, any suitable 
communications bus, such as a PCI bus. 

The controller 14 can be implemented, for example, as an application specific 
integrated circuit (ASIC), or alternatively, it can consist of several different chips. By way of 
example, in some embodiments of the invention described in more detail below, the 
controller 14 can be an intelligent processing controller, commonly referred to as Baseboard 
Management Controller (BMC) that can support Intelligent Platform Management Interface 
(IPMI) protocol. The IPMI protocol is an open standard that provides a standardized 
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message interface between a management application running on a host processor and the 
hardware platform. 

The exemplary controller 14 can communicate, via a communications bus 1 8, with a 
hardwa^mointormoduleaOandadigitaJparameteradjustff . 
command signals to these modules and/or to receive information therefrom The 
communications bus 18 can be any suitable proprietary or public bus. For example, in 
embodiments in which the controller is BMC, the bus 18 can be a private*? Outer- 
Integrated Circuit) bus or an Mgent Platfimn Management Bus (1PMB). 
the bus 1 8 can be an ASA or a USB bus, or any other suitable communications bus. 

Moreover, the controller 14 can communicate with an external system 24, via a bus 
26,thatc m instructthecon^ The external 

system 24 can be, f or example, a terrmnal that can cunununicate wrih the controller via a bus 
such as, 3**5232 bus. AllB^a.^^*^^ ' 
excommunicate with me controller 14 via a computer network connection, such as, a LAN- 
based Efoernet connection. Thebus26c OT beanysmtablebus,suchas.aIAN^ 
Ethernetconnection. The controller cm also initiate n^te^m^*^^ 
switch or a jumper. 

^^^ermclud^ 
coopemtvefyp^^ Many of mese subsystems m components 

remure storing and/or testing during development, manufacturing and/or in the field to 
ensure proper design and/or operation of the computer device. More specifieaUy, many of 
foeseconmonentsreuu^ 

cond.oons. Such components 28 for wmch margin testing is desired, herein referred to as 
margmabiecomponents, can include, for example, central proc^smg units (CPU), memory 
modules, mtemal communication buses, voltage regulators, or any other component or 
subsystem of components of interest for which margin testingmay be required. 

mo reo r***T eter ^^ 

momofthema^ablecomponents 28 directly, , g ., to adjust clock frequency, or via one or 
more mtermediate modules 30 that eeae™* » „>i~.~, 

8 a 5elected °P eratui g parameter for application to 
•hesecmnponcnts. ^^^^o^^^JJ^^* 
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which the parameter adjuster is a digital potentiometer, the intermediate module can be a 
voltage regulator whose output can be adjusted by varying the resistance of the digital 
potentiometer under commands from the controller. 

The hardware monitor 20 can monitor the components in real time through sensors 32 
associated with specific component properties, e.g., voltage, temperature, operating 
frequency, etc. The sensors 32 can generate data indicative of the response of the 
components 28 to variation of one or more operating parameters, such as, temperature, 
voltage, or driving frequency. The hardware monitor 20 receives this response data, and can 
transmit the data to the controller 14 for analysis, as discussed in more detail below. 
Although in this schematic illustration, the sensors 32 and the hardware monitor 20 are 
shown as separate modules, those having ordinary skill in the art will appreciate that some or 
all of the sensors can be integrated in the hardware monitor. 

With continued reference to FIGURE 1A, the digital parameter adjuster 22 can effect 
variation of an operating parameter associated with one or more of the marginable 
components, either directly or via the intermediate module 30, over a selected range of 
values. More particularly, the controller 14 can transmit command signals to the digital 
parameter adjuster 22 to instruct the adjuster to vary the value of a selected operating 
parameter associated with one or more of the components 28. 

For example, with reference to the flow chart of FIGURE IB, in step A, standby 
power is applied to the system under test with the system's primary power source off. In step 
B, a "Margin Mode SeT command is transmitted to the BMC, eg., from an external system, 
to instruct the BMC to initiate margin testing. Upon receipt of an acknowledgement from the 
BMC, a "Margin Value Set" command is transmitted to the BMC to instruct the BMC to set 
the value of an operating parameter unto test, e.g., voltage or frequency, to a test value (step 
Q. Step C can be rq>eated until all margin parameter values have been transmitted to the 
SUT and respectively acknowledged. Subsequently, in step D, a "Margin Start Command" is 
transmitted to the BMC to cause it to power the system, i.e., switch on the system's primary 
power source. In step E, the progress of the test is monitored and logged. Upon completion 
of the test at this test point, the primary power is switched off (step FX and die above 
procedure is repeated for other test points, if desired, until data at all test points are collected. 
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computer system that are capable of operating independently of operating system's (OS) 
control and/or intervention. If needed, the controller 14 can communicate with these 
management agents to provide in-band system monitoring. 

Typically, out-of-band operation is preferable for performing margin testing of a 
computer system because the system's OS and its agents can be susceptible to crashes and 
other aberrant behavior under stresses associated with margin testing. It is desirable to 
monitor and log the progress of a margin test For example, if a Mure occurs at a test point, 
it is desirable to log information regarding the test point and other related data. Anout-of- 
band agent, such as a BMC that is powered by a non-margined voltage rail, e.£, a stand-by 
power source, will not be affected by system level margin settings, and hence will be 
available to perform such monitoring and logging of a margin test. 

A margin testing system according to the teachings of the invention can be 
implemented in a variety of different ways to allow programmable integrated margin testing 
of a computer system, e.g., a server. By way of example, FIGURE 2 schematically illustrates 
one embodiment of a margin testing of the invention incorporated in the computer system 10 
that includes, in addition to the controller 14, a voltage control block/module (VCB) 34, a 
frequency control block/module (FCB) 36, and a Fault bypass block (FEB) 38. In response to 
commands from the controller 14, the VCB 34, the FCB 36 and the FBB 38 can be employed, 
respectively, for voltage Margin testing, frequency margin testing, and for sdectiv 
masking automatic mechanisms integrated in the system under test (SUT) for responding to 
faults during margin testing. Although this exemplary margin testing system includes both a 
frequency and a voltage control block, other embodiments may include only a voltage control 
module or a frequency control module. 

Each margin testing block 34, 36, and 38 incorporates devices and associated circuitry 
required for performing margin testing of selected components of the server under control of 
the controller 14. Exemplary implementations of each of these modules are provided further 
below. 

With continued reference to FIGURE 2, the controller 14 can communicate with each 
of the VCB, FCB, and FBB modules via the bus 18 to transmit commands thereto. Hie bus 
1 8 can be any suitable bus for providing communication between the controller and these 
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modules. Forexanmle,msomeembo*m^ 18is 
an I 2 C private bus. In addition, the controller 14 can communicate via the system interface 
16, e.g., a PCI bus, to the server's operating system and one ormore management agents. 

A stand-by power source 40 can provide power to the controller 14 to ensure that the 
controller can function when the system's primary power source (not shown) is switched off. 
In addition, the stand-by power source 40 can supply power to other dements, such as VCB 
34, FCB 36, and FBB 38, that participate in margin testing of the computer system. Further, 
the controller 14 can transmit commands to a power control circuitry 42 via the bus 18 to 
contarf switchi^ 

The external system 22, which can be, for example, a user or a script entity, can 
transmit commands to the controDer 14 for initiating margin testing of the server. More 
particularly, the external system 22, via a user or a preprogrammed instruction set, can 
transmit acommand to the controller 14 to cause fce controller tomtit 
selected components of the server. Such a margin test is typically initiated wimtheprimary 
power ofX and with the stand-by source providing power to the controDer, and to the 
ancillary margin testing blocks, e.g., the VCB 34, the FCB 36, and the FBB 38. In response 
to commands from the external system 22, the controller transmits command signals to one 
or more margin testing blocks, such as, the VCB, FCB, and/or FBB to effect resumption of 
testing of marginable components of the senrer. TypicaUy, me wmtroUer 14 inst^ 
FBB 38 to mask selected faults during the performance of the margin test, as discussed in 
more detail below. 

m many embodiments of the invention, the controller 14 includes firmware that can 
be programmed to step the voltage or the frequency apphed to marginable components of a 
system under test through a discrete number of pre-defined values, upon initiation of margin 
testmg. Alternatively, upon initiation of margin testing, the external system 22 can transmit a 
senes of commands to the controller, each of which can instruct the controller to set the 
frequency or voltage to a desired test vahre. At each value of the voltage or frequency, the 
system's response can be monitored and analyzed. 

With continued reference to FIGURE 2, in response to commands from the controUer 
the margm test module 36 can adjust clock frequency applied to selected components, such ' 
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as, CPUs or synchronous buses, and the VCB module 34 can adjust voltages of selected 
power rails, as discussed in more detail below. For example, the FCB 36 can step the clock 
frequency through a number of discrete values spanning a selected range, and the VCB can 
step voltages of selected rails through a discrete set of values. At each value of the clock 
fiequency or the rails voltage, the response of the system can be monitored and recorded. 

m preferred embodiments of the invention, components and subsystems for which 
inargmtestmgcanbeperfonned,i^ 

instructed, for example, by the controller 14, to do otherwise, For example, rails voltages 
default to nominal values unless programmed, for example, via the VCB, to deviate from 
thesevalues. Furthermore, these default values can be re-set when the system power is 
cycled. ' 

With reference to FIGURE 3, the VCB module 34 can be employed to adjust voltages 
of selected raib 44. herem also refers 
test commands from the controller 14. 



one 



The voltage control block 34 can be implemented in a variety of different ways. In 
such implementation, which is described in more detail below (See FIGURES 1 0 and 
1 1), the VCB 34 can include a digital potentiometer that is incorporated into a resistive 
feedback ciromry of a voltage regulator whose output corresponds to a rail voltage. In 
response to c<nnmands from the contreUer, me digital iwtentiometerca^ 
fceregulator'sfeedbackci^^ 

Referring again to FIGURE 2, the FCB 36 module can also be implemented in a 
variety of different ways. For example, in one implementation described in detail further 
below with reference to FIGURE 5, the FCB 36 can include a digital frequency synthesizer 
whose output frequency, which can be applied to selected margmable system components, 
can be varied in response to commands from the controller, fo this manner, one or more 
margin test frequencies can be applied to system components, such as, CPU's, for which 
frequency margin testing is desired. 

With continued reference to FIGURE 2, the fault bypass block 38 can mask selected 
faults during margin testing in order to ensure that automatic response fault mechanisms 
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integrated into the computer system 10 would not adversely affect margin taring of the 
system. Such automatic response fault mechanisms can provide environmental safeguards 
for example, temperature monitoring via diodes, or relate to over/under-voltage "power- ' 
good" reset circuits, or any hotswap "healthy" outputs that may cause a system reset, or other 
smular mechanisms. As discussed in more detail below, the FEB 38 can employ digital 
enable/disable signals to disable selective fault lines during margin testing, and re-enable 
them once the test is completed. Similar to the other margin testing modules described 
above, the FEB can receive power fiom the stand-bypower source to bo able to operate when 
the main power source is off for margin testing. 

Bywayof example, with refereiK^ 
36 can include a programmable logic device (PLD) 46 that receives signals from the 
controller to disable selective automatic fault response mechanisms. For example, the 
contmU^c^h^ctthePL^ 

intercept and mask selected fault interrupts that can be generated in the system under test In 
Ous example, the PLD can communicate with a hardware monitor 20 to receive/mtercept 
signals ttat are noinudlyinmcatrveof^ 

.gnals when margin testing of the computer system is in progress. For example, as discussed 
rcmoredetaubefow.wh^oper^ 

signals to thepoweroontrol element42 to ensu« that it wih not power down d« computer 
system when voltage nrargm testing of selected power rails of me computer system is in 

progress In the absence of margin testing, mat is, when toe PLD is not opemting in nungin 

mod e , rt wmpassfeult a g^ 

element 42 to ensure that appropriate actions will ensue when a valid voltage fault occurs A 

F^exan^aPLDmarW^ 

the trade designation MAX 7000B can be employed. 

FIGtm^!?" illUStrati ° n ° f * e impICmCntati0n Md »»*»*» of the EBB module, 
FIGURE 4B depots that the FEB module 38 communicates with the controller 14 and the 
hardware monitor 20, which in this example is selected to be an integrated circuit marketed 
under the trade designation LM87 by National Semiconductor company of Santa Clara 

hardwaremomtoringofvanouscom^ 
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For example, the LM87 can be employed to monitor power supply voltages, motherboard and 
processor temperatures, and fan speeds. The LM87 includes a serial bus interface that is 
compatible with an I J C bus, and hence can communicate with the controller 14 via an I 2 C bus 
in embodiments in which the controller 14 is a BMC, or a similar device with comparable 
functionality. 

With continued reference to FIGURE 4B, the FBB 38 can affect various functions of 
the LM87 hardware monitor, for example, voltage monitoring, temperature monitoring, and 
fan speed control. For example, in the absence of voltage margin testing, that is, during 
normal operation of foe computer system, an oufoutpfooffoe Undesignated as 
INT#ALERT# can generate an interrupt signal when foe voltage of a system's power rail, 
which is monitored by foe LM87, varies by more than a selected amount, e.g., 5 percent, from 
its nominal value. In foe absence of foe FBB module 38, this interrupt signal is typically fed 
to foe power control element 42 to cause it to take appropriate actions, e.g., power down foe 
computer system. 

However, in this example, foe FBB 38 receives this interrupt signal. If no voltage 
margin testing of the computer system is in progress, foe FBB transmits foe interrupt signal to 
foe power control element 42 so that appropriate actions can be taken in response to a voltage 
fault. However, during voltage margin testing of a power rail monitored by foe LM87, foe 
rail's voltage may be varied more than a threshold that would normally cause a voltage fault 
For example, it is customary to vary anril's voltage by mi)re than 5 percem for voltage 
margin testing thereof Thus, during voltage margin testing, foe FBB 3 8 operates in margin 
mode, eg., in response to a command from foe controller 14, and "masks" foe interrupt signal 
generated by the LM87 from the power control element fo other words, foe FBB, rather than 
transmitting foe interrupt signal received from the LM87, provides foe power control element 
42 with an appropriate signal level indicating that no faults have been detected. Such 
masking of foe interrupt signal ensures that foe power control element will not disrupt voltage 
margin testing while it provides response to voltage faults during normal operation of foe 
system. 

With continued reference to FIGURE 4B, foe FBB module 38 can also provide 
masking of temperature fault signals during temperature margin testing of selected 
components offoe computer system under test The computer system, during its normal 
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operaton, may generate and log critical system warning,, increase fan speed, or even initiate 
a power down of the system when one or more monitored temperatures, e.g. the CPU's 
terrrperature monitored by a diode 48, exceed selected thresholds. During temperature margin 
testing, such thresholds axe typically exceeded. Hence, during temperature margin testing, 
the FBB 38 can mask temperature fault signals to ensure that margin testing will proceed 
w^out disruption. For example, the FBB can intercept a temperature interrupt signal 
generated at an output pin of theLM87 designated as THERM*, and can mask this signal 
dunng margin testing of the system. For example, rather than transmitting the intercepted 
raBRM#s.gnal to the power control element 42. the FBB ^transmit another signal, or no 
signal m the case of an interrupt-dnven scheme, to the power control element 42 indicating 
that no temperature fault has occurred 

Wi *~^«WtoFIGU^ 
al^unTized to control the speed ofafm 50. In particular, the FBB receives an output signal 
generated by the fan, namely, the fan's Tach" output, that is mdi^ve of me i^'s speed 
Dunng normal operation of the computer system, the FBB transmits this signal to the LM87 
hanhvaremonito, The LM87 can be programmed to increase the fan's speed when selected 
tcrn^tureduesholds are exceeded. For example, the IMS7 can change the amplitude of a 
a^^bytoDACOut^^^ 

amphfier 52, which powers the fan, in order to increase the fan's speed. During margin 
testrng, it may be desirable to ^able «mtrol signals fix^ me LM87 to me 

selected thresholds, ^f^^^^^lim^.*^^ 
signal, rather than the actual tach signal received from the fan, to indicate that the fan is 
spnuring at full speed even though the actual fan speed has been reduced to lower levels for 
maxgm testing of the system. The simulated tach signal enures that the LM87 will not take 
actrons, forexampie, **^. eo „^^ tetoil ^ a2iiB ^^ 
to mcrease the tan's speed, thereby allowing margin testing to proceed. 

Those having ordinary skil 1 in the art wil 1 appreciate that an I^B module of the 
example, dunng fiequency margin testing, the FBB can be employed to mask system 
margmable components crossing selected thresholds. 
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By way of another example, the FBB can be designed to intervene within the normal 
thermal response mechanisms of an Intel Xeon-class processor. The dual- and multi-class 
Xeon processors include thermal monitoring features, e.g., TCC (thermal control circuitry), 
that allow automatic and/or externally invoked modulation of core clock duty cycle in 
response to high temperature operating conditions, which can be similar to those encountered 
in a margin temperature testing environment The FBB can be programmed to respond to 
such thermal-related processor signals, eg., PROCHOT#, THERMTRIP, etc, in such a way 
so as to disable or to invoke duty cycle modulation -modulation that ihcidentalxy degrades 
performance- to obtain a desired processor response behavior. This can be useful When 
quabfying comrniter-ime^ availability of processing power under 

all supported operating conditions, In production, the EBB can be employed to configure and 
dynamically respond according to thermal rules defined for a given platform, thus allowing 
leverage of design components and connectivity schemes on platforms specified according to 



Exemplary embodiments of the frequency control block and the voltage control block 
willbeprovidedbetow. For example, the following embodiment fflrrstratestheincor^^ 
of a digital frequency synthesizer accordmg to foe teaching 
corrupter system, which employs Intelligent Platform Management Interface (TFMI) 



More particularly, FIGURE 5 schematically illustrates a server computer system 54 
that utilizes industry standard IPMI for implementing in-band and oufcof-band management 
features. The exemplary saver 54 includes a BMC controller 56 that primarily controls in- 
band and out-of-band hardware or software management, such as, monitoring, event logging, 
and errorrecovery. The BMC 56 communicates, via the system interface 16, with the 
server's operating systems, and management agent applications ninning on the server host 
processor. 

The illustrated BMC controller employs a private I ? C (biter-Integrated Circuit) bus 58 
for communication with selected subsystems and components of the server. For example, in 
this exemplary embodiment, the BMC 56 communicates, via the I J C bus 58, with the 
hardware monitor 20 and a serial electrically erasable programmable read-only memory 
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(S^ROM^Otbatcratains^ J( 

should be understood that the BMC 56 can also utilize the I J C bus 58 for communication with 
other internal server modules not shown here. 

The BMC 56 further employs an I J C based Intelligent Platform Management Bus 
(IPMB) to communicate with and manage one or more field replaceable units (FRUs) 
as illustrated FRUs 62 and 64. Tlese FRUs can be intelligent devices, such assatelhte 
management controllers, orpassive devices, such as SEEPROMS. 

With continued reference to FIGURE 5, the exemplary server 54 further includes a 
clock generator66, e.g., a programmable frequency synthesizer, fljatismcorporatedinthe 
server 54 in accordance with the teachings of the invention to communicate with fee BMC 
56. In particular, the exemplary clock generator 66 includes an I 2 C interface 66a that allows 
it. coupling to the I 2 C bus to receive messages from the BMC 56. The illustrated frequency 
syntheazer 66 can receive a reference clock signal, for example, from an internal crystal 
collator 66b, and can generate an output clock signal as a selected multiple of fee input 
reference signal. ^ ontpm clock signal be apphed to margma^ 
68 for margin testing tbereof. 

More Particularly, the BMC 56 can communicate wife fee frequency synthesizer 66 to 
vary its output clock frequency over a number of discrete values within a selected range 
Hus variation of fee output clock frequency can be utilized for frequency margin testing of 
the marginable system components 68. In other words, fee BMC 56 can dynamically issue 
margin control commands to the clock generator to vary its output frequency. 

A variety of I*C configurable integrated circuit clock generators can be employed in 
the practice of the invention for frequency margin testing. Such contemporary clock 
generators advantageously provide high accuracy and internal feedback regulation that render 
feem particularly suitable for frequency margin testing that typically calls for low-Jitter, and 
high-speed clock frequencies. Spread spe^ functionality is also available to help 
mitigate EMI (Elctro-Magnetic Interference) issues. 

B y w *y° f «™Pfc.HGURE6A 
for a generic programmable frequency synthesizer suitable for use in fee practice of fee 
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invention. The clock generator 70 can include an internal crystal oscillator 72 that can 
provide a stable signal at a selected frequency that can be utilized as a reference signal. 
Alternatively, the synthesizer 70 can employ an external reference signal coupled thereto at 
an input port 70a. The exemplary frequency synthesizer 70 further includes an I 2 C interface 
74 that allows communicatiori with an I J C bus, and a register 76 that can store instructions 
received, for example, from the BMC 56 (FIGURE 5). 

A reference signal, generated by the crystal oscillator 72 or provided by an external 
source, is fed into a phase locked loop circuit 78 that generates an output signal at a 
frequency that is a binary multiple of the reference signal based on the instructions stored in 
the register 76. More particularly, the exemplary phase locked loop circuit 78 includes a 
phase detector 80, a low pass filter 82, a voltage controlled oscillator (VCO) 84, and a 
modulo-n divider 86. The divider 86, which is coupled to the register 76, receives an output 
signal of the VCO and generates an output signal at a frequency that is a selected binary 
fraction of the fiequency of fee VCO signal. More specifically, fee instruction stored in fee 
register 76 determines fee binary factor by which the frequency of the divider's output signal 
feffasflomfeatofiteinputagnal.name^ The 
phase detector 78 compares the phase of the divider's output signal with feat of fee reference 
signal, and generates a correction signal based on any measured difrerence that is in turn 
applied, viaalow pass filter 82, to the VCO 84 to shift the VCO'a output fiequency, if 
needed, and ultimately lock fee VCO's output frequency to a desired binary multiple of fee 
reference frequency. In this manner, the frequency synfeesizer generates aii output signal at a 
frequency determined by the instructions received, for example, from the BMC 56 (FIGURE 
5). 

A variety of commercially available programmable frequency synthesizers can be 
employed in fee practice of fee invention. For example, a clock generator suitable for use in 
fee practice of fee invention can be selected to be a programmable phase-locked loop clock 
generator marketed under trade designation FS7140/FS7 145 by AMI Semiconductor of 
Pocatello,ID,U.S.A. 

Wife reference to FIGURES 2 and 6B, in another embodiment, fee FCB module 36 
can be implemented by utilizing a plurality of clock sources, such as clock sources 88, 90, 
and 92, each of which generates a clock signal at a selected frequency. By way of example, 
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the clock source 88 can generate a signal at a frequency of 95 MHz while the clock sources 
90 and 92 can generate signals at 100 MHz and 105 MHz, respectively. In response to 
commands fiom the controller 14, a multiplexer 94, which receives the output of each clock 
source as an input signal, can select and route one of these clock signals to its output as a test 
frequency for application to marginable components of the computer system. Although only 
three clock sources are illustrated in this example, those having ordinary skill in the art will 
apprecate that any number of clock sources can be employed for generating a plurality of 
different test frequencies. 

Withreference to FIGURE 7, some embodiments of the invention provide frequency 
ma^testmgbyut^^ 

clock frequency each of which can be selected in response to an input bit pattern received 
from the controller, eg., BMC. For example, the BMC 56 can supply a 16-bit input to a 
synthesizer 96 in order to select one of me 2 14 frequendes that can be generated by the 
syntfaesizerasitsoumut clock frequency. For frequency margin testing, the BMC 56 can 
apply asequence of bitpatterns to the frequency synthesizer, where each bit pattern instructs 
^svnmesizertogenerateoneofits^ For each output frequency 

the response of the system can then be monitored in a manner described in more detail below. 

In another embodiment schematically depicted in FIGURE 8, an I*C I/O expander 98 
^enmloyedmrsupplyingabitpatternofmputaig^ 
thesyutheWsou^^ 

can conmnmicate withthel'ci/O expander, via the I^C bus 58, to set values of selected 
outputp^oftheexpand^^^ 

output frequency of interest Anumber of conunemialhy available^ I/O expanders can be 
emplov^m^ For example, an I^Cexpa^erchipmanun^ 

rCF8575C can be utilized. 

thete t'^^^^^^^^^'^^^ 
the teachmgs of the invention is preferably implemented such that clock frequencies or power 
^ges^ 

-structed to do otherwise, for example, in response to commands from the controller By 
way of example, with reference »m^*in~n^ l ^ mil ^^ 
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56 communicates, via the r*C bus 58, with the I'C I/O expander 98 whose output is in tun. 
coupled to two multiplexer 1 00 and 102. More particularly, one set of output pins of the I>C 
WO expander 98, herein schematicaUy depicted as signal A, provide one set of input values 
for the multiplexer 1 00 and another set of output pins of the fC-based I/O expander 98 
herein schematically depicted as signal B, provide a set of input values for the other 
multiplexer 102. In addition, the multiplexer 1 00 receives default input signals C from the 
CPU that provide default voltage select signals for VRM type voltage regulators 104, and the 
multiplexer 1 02 receives default input signals D that provide default clock frequency for the 
clock distribution chip 1 06 whose output frequency can be adjusted by a bit pattern of input 
signals applied thereto. 

fafoeabsenceofaagiialapphedto ^ 
output of each multiplexer, and hence the frequencies applied to the clock distribution chip or 
voltage select signals applied to the VRM type regulators, are determined by the default input 
5.gnals,namelysignalsCandD. ***m*mm+m+**^cmlm** 
one or more commands to the I J C I/O expander to set the values of its output pins 
corresponding to signals A and/or B, which provide input signals for multiplexer 100 and 
1 02, respectively. In addition, the controller applies a signal to the SELpin of either, or both, 
multiplexers to cause the multiplexer to route the signals received from the I 2 C I/O expander 
to its output pins. Thus, the output signal of one or both multiplexers changes from default 
values to values dictated by the controller, which in turn causes adjustment of the frequency 
generated by the clock distribution chip 1 00 and/or voltage select signals applied to the 
VRM-type regulators, fa this manner, default clock frequencies and default VRM voltages 
are employed in the absence of contrary instructions from the controller, and margin 
frequency or margin voltage tests are readily accomplMed in response to commands from 
the controller. 

Typically, the level of granularity required for frequency margin testing is not as fine 
as that needed for voltage margin testing. However, programmable clock generation devices 
that prov.de fine frequency resolution are available if the ability to perform precise and 
granular frequency variation is imperative to the completion of a margin test plan. 

A testing system of the invention, such as the above exemplary system, can be 
employed to perform frequency margin testing of various component, of a computer system. 



20 



By way of example, a frequency margin testing system according to the invention can be 
incoipontted into an Itanium Processor Family (IPF) based computer server to provide 
frequency margin testing of the server's front-side bus (FSB) clock frequency. Sucha 
frequency margin testing of the FSB may be desired, for example, when the server's CPUs 
are replaced with CPUs of a new generation. 

With reference to the flow chart of FIGURE 9B, in one embodiment, to perform the 
frequency margin testing of the FSB, in step A, the BMC can be caused to initiate automated 
frequency margin testing of the FSB, eg., a field engineer can issue a command to the BMC 
via a console to cause the BMC to initiate margin testing Upon initiation of margin testing, 
the BMC can cause a frequency synthesizer to apply different frequencies to the FSB over a 
frequency range centered about a nominal FSB clock frequency. For example, the BMC's 
firmware can be pre-programmed to loop through a number of commands transnutted to a 
frequency synthesizer, each of which sets the synthesizer's output frequency to one of a 
Plurality test values. For example, With the main system power off, the BMC. which can be 
powered by a stand-by supply, can transmit a message, via the I 2 C bus, to the digital 
frequency synthesizer to instruct the synthesizer to apply a selected frequency eg a 
frequency of 180 MHz, to the FSB, which runs nominally at a frequency of 200 MHz 
Subsequently, in step B, the BMC will switch on the main power to the server, which causes 
fcesystemtoexecuteitsb^^^^ 

n» BMC monitors the self test. If the test fails, the BMC stores the test results and 
^formation regarding the test point, e.g, test frequency, on non-volatile memory. The BMC 
then switches off the main system power supply (step D), and sends another command to the 
frequency synthesizer to instruct the synthesizer to apply another test frequency e.g a 
frequency of 190 MHz, to the FSB (step B). If the self-test is success*!, the BMC aliows ihe 
boot process to proceed to the stage of loading the operating system, logs the test result, 
swrtches the main power off, and instructs the synthesizer to apply another test frequency to 
the FSB. In this manner, the frequency synthesizer applies a number of different 
frequences within a selected range to the FSB, and the BMC stores the test results 



test 



Upon completion of the test under BMC control, test results can be examined to 
.dentfy failure points, if any, and to provide any necessary trouble-shooting to ensure that the 
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upgraded server will function reliably. Further, the margin test results can be uploaded onto a 
database for reliability/quality analysis. 

Alternatively, with reference to the flow chart of FIGURE 9C, the frequency margin 
testing can be performed in the following manna-, fit step A, the BMC can be instructed to 
set the synthesizer's output frequency to a desired test value. This can be done, for example, 
by an external scripting entity that issues a command to the BMC. A diagnostics software 
can thai be executed, in step B, on the server to obtain information regarding selected aspects 
of the server's operation at this test point. Those having ordinary skill in the art will 
appreciate that such software is commercially available. This information can be analyzed to 
determine whether the server's operations are satisfectory at this test point The information 
can also be recorded, if desired. Subsequently* the BMC can be instructed to adjust the 
synthesizer's output frequency to the next test value (step D), and the above process can be 
iterated until information at all test points are collected and analyzed 

In some embodiments of the invention, a descriptor file can be provided that includes 
a policy for BMC to follow in performing margin testing of the system under test For 
example, such a descriptor file can include parameters associated with a margin test, e.g., 
voltage values for different test points, instructions regarding the steps to be taken in case of 
feihire at a test point, etc. The BMC can gather information regarding the results of a margin 
test, e.g., failure or success of the test, at a particular test point by, for example, reading 
("snooping") data regarding the test results transmitted on a bus, e.g., an RS232 bus, to an 
external terminal, or by communicating with IP MI daemons running on the system's OS. 
Based on the obtained test results and the policy defined m the descriptor file, the BMC can 
take a subsequent action. For example, the test results data may indicate the failure of the test 
at a particular test point, and the descriptor file may indicate that in case of a first failure at a 
test point, the test should be re-run. In such a case, the BMC will reset the test value for 
another execution of the test at the previously failed test point Those having ordinary skill in 
the art will appreciate that a descriptor file can include instructions other than those provided 
above. 

Another operating parameter for which margin testing is typically needed relates to 
voltage applied to various components of a system under test FIGURE 10 schematically 
illustrates incorporation of a voltage margin testing system according to the teachings of the 
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invention in a computer server that employs the IPMI protocol. The exemplary server 108 
includes a BMC controller 56 that provides in-band and out-of-band hardware and software 
management, as described above. As in the above embodiments, in mis exemplary . 
embodiment, the BMC 56 employs a private I 2 C bus 58 for communication with selected 
subsystems and components of the server. 

A digital voltage adjuster 1 10, having an r*C communications interface 1 10a for 
coupling to the I J C bus, is incorporated in the server, in a manner described in detail below, 
to allow voltage margin testing of margmable components of the server. The digital voltage 
adjuster can be implemented as a single integrated circuit, or alternatively, it can be 
implemented as a plurality of integrated circuits. 

The digital voltage adjuster 1 10 is coupled to a voltage regulator 1 12, which receives 
an input voltage and generates a regulated output voltage that can be utilized as a rail voltage 
for application to various components of the server, such as imaginable components 1 14. In 
other words, the voltage regulator 1 12, which can be a linearor a switching regulator, can 
provide a regulated voltage rail for supplying power to various components and modules of 
the server. 

The voltage adjuster 1 10, in response to command signals received from the BMC 
controUer, can affect variation of the regulator's output voltage over a selected range for 
margin testing of one or more components to which such voltage variation is applied. For 
example, the BMC can instruct the digital voltage adjuster 1 10, via commands transmitted ... 
the I 2 C bus 58, to cause variation of the regulator's output voltage, and hence variation of the 
voltage applied to the components 114. For example, the voltage applied to the components 
1 14 can be stepped through a plurality of values within a selected range for performing 
voltage margin testing. 

In one preferred embodiment, the digital voltage adjuster is selected to be a digital 
potentiometer that can function as a digitally controlled variable resistor in a feedback 
resistance network of the voltage regulator 1 12 to adjust the regulator's output voltage. For 
example, with reference to FIGURE 1 1, a digital potentiometer 1 16 can be incorporated in a 
feedback resistance network of a linear regulator 118 to function as an adjustable resistor 
connected in series with another feedback resistor 120 in the regulator's feedback resistance 



on 
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network. The digital potentiometer can vary the resistance of the regulator's feedback circuit, 
thereby adjusting the regulator's output voltage. 

More particularly, with reference to both FIGURE 10 and FIGURE 1 1, the digital 
potentiometer can adjust the resistance in the feedback resistance circuit of the regulator 118 
in response to commands received from the BMC 56, and thus vary the regulator's output 
voltage. This variation of the regulator output voltage can in turn cause variation in the 
voltage of one or more components to which the regulator's output voltage is applied. 
Further, as shown in FIGURE 11, the digital potentiometer 116 can also be utilized to adjust 
the output voltage of a switching regulator 122. 

With continued reference to FIGURES 2, 10 and 1 1, by way of example, before the 
server's primary power source (not shown) is switched on, the external system 24 can 
transmit a command, for example, in the form of Set^VoltageCRail, Value), to the controller 
56 to instruct the controller to set die voltage at a selected rail to a specified value for 
performing margin testing. It is Ihe responsibility of the controller 56 to interpret this 
command into requisite I 2 C messages, and issue the messages accordingly, in order to service 
the command. As such, in response to this command, the controller 56 transmits a command 
to the digital potentiometer 1 16 to adjust its resistance such that the regulator's output voltage 
would be set at an initial value that is slightly below the voltage value specified by the 
external system. For example, the initial value can be less than the specified value by a few 
percents. 

In general, the degree of deviation of the initial voltage value from the specified value 
depends, among other factors, on the tolerance of the digital potentiometer. For example, if 
the full range of the digital potentiometer's resistance tolerance is 5 percent, the initial 
voltage value can be set about 5 percent below the specified value to ensure that the margin 
voltage will not exceed a threshold that would damage the system components. 

Subsequently, the BMC 56 transmits a command to the power control module 42 to 
switch on the system's primary power source. Various implementation of the power control 
module 42, and its communication with the BMC 56, are known to those having ordinary 
skill in the ait The hardware monitor 20 records the regulator's output voltage, and 
communicates the recorded voltage to the BMC. Typically, the voltage read by the hardware 



24 



monitor will be below a tolerable range of the specified value. In such a case, the controller 
56 will re-issue another command to instruct the digital potentiometer to correct the 
regulator's output voltage in the direction of the specified value. Based on a particular 
implementation of the controller's firmware, this voltage calibration cycle may be perfonned 
once, or it may be iterated several times before a sufficiently accurate voltage is read back 
fiom the hardware monitor. 

Upon setting the regulator's output voltage to the desired test value, the controller 56 
can instruct the power control module 42 to switch on the computer system's main power 
source. The system can then execute, for example, its built-in self-test, which can be 
monitored by the controller. This process can be repeated at subsequent test voltagesto 
obtain data regarding the system's response to a plurality of discrete test voltages. 

Although one digital voltage adjuster is shown in the above exemplary embodiments, 
those having or<tinary skill in the art will appreciate that two or more digital voltage adjusters, 
e.g., digital potentiometers, can be utilized in a server, or any other suitable computer system, 
in accordance with the teachings of the invention to adjust voltage variation of different 
voltage rails within the server. Thus, the process of setting rail voltages to test values can be 
performed across multiple component modules to accomplish testing of the computer system 
in an aggregate margm state. Similarly, in the above frequency margin testing embodiments, 
more than one clock frequency can be set at a time for performing aggregate margin 
frewquency testing. 

A variety of digital potentiometers can be employed in the practice of the present 
invention. For example, a quad digitally controlled potentiometer having an I J C interface and 
marketed by Xicor, Inc. ofMilpitas, California under the trade designation X9409 can be 
utilized as a digital voltage adjuster in a voltage margin testing system of the invention. 

Some embodiments of the invention, a feedback signal, for example, from the BMC 
controller, is periodically fed into a digital voltage adjuster, e.g, a digital potentiometer, that 
forms aportion of aresistive feedback circuit of a voltage regulator, as described above, to 
adjust the resistance of the voltage adjuster so as to set the regulator's voltage with a desired 
accuracy to a selected value. For example, FIGURE 1 1 schematically illustrates an 
exemplary implementation of such a feedback mechanism in which the hardware monitor 20 
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receives (he output voltage of the regulator 1 1 8 as an input voltage in order to monitor die 
regulators output voltage. The BMC 56 (FIGURE 10) periodically, for example, once every 
few milliseconds, queries die hardware monitor 20 to obtain the value of the regulator's 
output voltage* If the BMC determines that the regulator's output voltage deviates from a 
desired value by more than a selected threshold, it transmits a command to the digital 
potentiometer 1 16 to adjust the potentiometer's resistance, in a manner described above, so as 
to cause the regulator's output voltage to be at the desired value. This feedback mechanism 
is useful in accurately setting the regulator's output voltage. For example, in some cases, the 
actual resistance of a digital potentiometer can deviate Jrom its nominal resistance by a few 
percent, thereby causing an inaccuracy of a few percent in die regulator's output voltage. 
The above feedback mechanism can be employed to correct such discrepancies between the 
actual and the nominal values of the potentiometer's resistance, and hence improve the 
accuracy of the values of test voltages. 

A voltage margin system according to the invention, such as those described above, 
that incorporates a digital voltage adjuster in a computer system, such as a server, that 
operates under control of a controller internal to the computer system for voltage testing of 
selected components of die computer system provides a number of advantages. For example, 
such a voltage margin testing system is non-invasive in that it does not require utilizing 
jumpers or switches for modifying resistive values of feedback circuitry of voltage regulators 
for adjusting the regulators' output voltages, which can be time-consuming and can adversely 
affect the testing accuracy. Further, £ voltage margin testing system of the invention obviates 
the need for external test equipment, and allows performing voltage testing automatically by 
software control. Moreover, a Voltage testing system of the invention renders voltage testing 
during development, manufacturing, or in the field, practical, thus enhancing product 
reliability. 

Another advantage of a voltage margin testing of the invention is that it facilitates 
root-cause analysis of system failures. For example, in some cases, intermittent failures can 
be made repeatable, and hence more readily diagnosed and corrected, by varying power rails 
voltages. Other advantages of a voltage margin testing of the invention are readily 
recognizable by those having ordinary skill in the art 
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By way of example, a voltage margin testing of the invention can be utilized to test a 
2.5 volt power rail that supplies power to DDR SDRAM DIMMs in a server. Such a test may 
be required, for example, during manufacturing to qualify DIMMs obtained from a new 
DRAM vendor. Such a voltage margin test can be conducted, for example, as follows. 
Initially, the BMC controller can be placed in a special mode, for example, by gamin g 
console access to the BMC and issuing a mode-change command. In this mode, the BMC 
will unlock a command that performs automated voltage margin testing of the DIMM rail 
More particularly, the BMC can vary the voltage of the DIMM rail over a number of values 
(e.g., centered about the nominal voltage value of 2.5 V), each of which corresponds to a test 
point, by issuing commands to the digital potentiometer, as described above. 

The test of the system at one such exemplary test point can be accomplished as 
follows. With the server's main power source oft, the BMC, which can be powered by a 
standby power source, transmits an I 2 C message to the digital potentiometer to cause it to 
adjust its resistance so that the power rail's voltage is at 2.25 V (10% less than the nominal 
voltage). Subsequently, the BMC switches on the server's main power source. The system 
executes its built-in self-test (BIST), which is monitored by the BMC, as part of the early 
boot process. If the BIST fails, the BMC logs the result and information regarding the test 
point, e.g., test voltage, to non-volatile memory, turns off the server's main power source, 
and instructs the digital potentiometer to set the next test voltage, eg., 2.375 vohs. If toe 
BIST is successful, the BMC allows the boot process to proceed to the operating system (OS) 
load stage, logs the success of the test, followed by turning off the main power source, and 
instructing the digital potentiometer to set the next test point After the OS load stage, 
various system-level subsystem stress diagnostics can be executed, either automatically 
through scripted batch calls, or via BMC command messages to the OS agents. Run logs can 
be stored off-system or on local hard disks for later analysis. 

Once all test points are executed, results data can be collected and examined. If there 
are failures at one or more of the test points, the test executor can conduct root-cause analysis 
of the failures. Further, the margin test information can be uploaded into a database for 
reliability/quality analysis. 

In a similar fashion, the above exemplary voltage margin testing can be performed by 
instructing the BMC to set the test voltage to an initial value. A diagnostics software can 
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then be executed on the server to collect information regarding selected operations of the 
server at this test voltage. The information can be analyzed and recorded, or be recorded for 
future analysis. Subsequently, the BMC can be instructed to set a new test voltage, and the 
above process can be iterated to obtain data at all desired test voltages. 

With reference to FIGURE 12, another implementation of an embodiment of a 
voltage margin testing of the invention employs a digital-to-analog converter (DAC) 124 that 
can generate a plurality of voltage output values, such as exemplary outputs A, B, C, and D, 
for voltage margin testing of various power rails of a computer system under test More 
particularly, the DAC 124 can receive a reference voltage from a reference voltage source 
126, and can generate selected output voltage values, for example, in response to commands 
from the BMC 56. In this example, the DAC is selected to be an integrated circuit marketed 
by Analog devices corporation of Norwood, MA, U.S. A under die designation AD5315. The 
DAC 124 can communicate with the BMC 56, via an EC I/O expander switch 128, through 
serial bus lines 130 and 132 to receive instructions for setting one or more of the output 
voltages A-D to selected values for margin testing. Each output voltage of the DAC 124 can 
be coupled, for example, via amplifiers 134, to a switch, such as, switches (e.g., FBTs) 136a, 
136b, 136c, and 136d, herein collectively referred to as switches 136, that can be selectively 
activated via signals from a field programmable gate array (FPGA) 138 to provide a selected 
margin voltage. These switches are used to isolate die trim lines during nominal operation. 
Pull-up resistors 140a, 140b, 140c, and 140d are utilized to ensure that the switches 136 
default to die nominal off state, thus guaranteeing isolation of the DAC analog outputs in case 
of part faults, firmware glitches power resets, etc. Under margin modes, the switches 136 
are turned on and similar FET transmission switches are used to isolate the nominal-mode 
pull-up and pull-down resistors 140 that create appropriate voltage-divided trim inputs during 
nominal operation. In this exemplary embodiment, the margin voltages are selected to be 1.2 
V, 1.5 V, 2.5 V, and 3.3 V. Those having ordinary skill in die art will appreciate that other 
values of margin voltages, and also more than four margin voltages, can be employed. 

Those having ordinary skill in die art will appreciatete that various modifications can 
be made to the above embodiments without departing from the scope of the invention. 
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What is claimed is: 

1 . In an electronic system f 1 08], a system for voltage margin testing of one or more 
components [114] of said system, comprising: 

a controller [56] internal to said electronic system; and 

a digital voltage adjuster [1 10] in communication with said controller [56] and with 
said one or more components [1 14], said voltage adjuster [1 10] affecting generation of one or 
more test voltages for application to said components [ 1 1 4] in response to commands from 
the controller. 

2. The margin testing system of claim 1, further comprising: 

a diagnostics software executing to collect and analyze data regarding a response of 
said system to said lest voltages. 

3. The margin testing system of claim 1 1 further comprising: 

a power rail [44] electrically coupled to said components for applying voltage thereto, 
said voltage adjuster [110] being electrically coupled to said power rail to set said power rail 
voltage to one or more of said test voltages; 

4. The margin testing system of claim 3, further comprising: 

a voltage regulator [1 18] receiving an input voltage and generating a regulated output 
voltage for application to said power rail, said voltage adjuster [110] being coupled to said 
regulator for varying said regulated output voltage in response to commands from said 
controller. 

5. The margin testing system of claim 4, wherein said digital voltage adjuster [110] 
comprises: 

a digital potentiometer [1 16] incorporated in a feedback circuitry of said voltage 
regulator [118], 

wherein the digital potentiometer [116] varies a resistance associated with said 
feedback circuitry in response to commands from said controller so as to vary said output 
voltage of the regulator. 



-29- 



6. The margin testing system of claim 1 , wherein said controller [56] is a Baseboard 
Management Controller (BMC). 

7. A method for voltage margin testing of one or more components of an electronic, 
system [108], having an internal controller [56], and a digital voltage adjuster [1 10], in 
communication with said controller and with at least a power rail [44] supplying voltage to 
said components, comprising: 

causing the controller [56] to transmit one or more commands to said voltage adjuster 
[ II 0] to cause the adjuster to affect generation of one or more test voltages at said power rail 
for application to said components, and 

monitoring response of said computer system to each of said test voltages. 

8. The method of claim 7, further comprising: 
selecting said controller [56] to be a BMC. 

9. The method of claim 8, further comprising: 

employing an l 2 C-based bus [58] for providing communication between said BMC 
and said voltage adjuster. 

1 0. The method of claim 7, further comprising: 

utilizing a hardware monitor [20] to measure voltage at said power rail and 
transmitting said measured voltage to said controller. 
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